The Multimodal Listening Test in a High-Stakes Context: Gender-Neutral or not?
نویسندگان
چکیده
In this study, we used the Rasch measurement to investigate fairness of listening section a national computerized high-stakes English test for differential item functioning (DIF) across gender subgroups. The format inspired us whether items measure comprehension differently females and males. Exploring novel task types including multimodal materials such as videos pictures was especially interesting. Firstly, unidimensionality local independence data were examined preconditions DIF analysis. Secondly, authors explored performance female male students through analysis using measurement. uniform showed that 25 (out 30 items) displayed favored different subgroups, whereas effect size not meaningful. non-uniform revealed several exhibiting with moderate large size, favoring various ability groups. Explanations are hypothesized. Finally, implications study regarding development discussed.
منابع مشابه
Differential Item Functioning (DIF) in Terms of Gender in the Reading Comprehension Subtest of a High-Stakes Test
Validation is an important enterprise especially when a test is a high stakes one. Demographic variables like gender and field of study can affect test results and interpretations. Differential Item Functioning (DIF) is a way to make sure that a test does not favor one group of test takers over the others. This study investigated DIF in terms of gender in the reading comprehension subtest (35 i...
متن کاملAssessing Assessment Literacy: Insights From a High-Stakes Test
This study constitutes an attempt to see what Language assessment literacy (LAL) isfor three groups of stakeholders, namely LAL test developers, LAL instructors, andLAL test-takers. The perceptions of the former group were derived from the contentanalysis of the latest version of the LAL test, and those of the latter 2 groups wereassessed through a survey designed by the researcher. Participant...
متن کاملInterpreting the Validity of a High-Stakes Test in Light of the Argument-Based Framework: Implications for Test Improvement
The validity of large-scale assessments may be compromised, partly due to their content inappropriateness or construct underrepresentation. Few validity studies have focused on such assessments within an argument-based framework. This study analyzed the domain description and evaluation inference of the Ph.D. Entrance Exam of ELT (PEEE) sat by Ph.D. examinees (n = 999) in 2014 in Iran....
متن کاملdifferential item functioning (dif) in terms of gender in the reading comprehension subtest of a high-stakes test
validation is an important enterprise especially when a test is a high stakes one. demographic variables like gender and field of study can affect test results and interpretations. differential item functioning (dif) is a way to make sure that a test does not favor one group of test takers over the others. this study investigated dif in terms of gender in the reading comprehension subtest (35 i...
متن کاملGender, Spatial Ability, and High-Stakes Testing
Researchers disagree on the relationships between gender, spatial ability and math achievement. Varied results from studies using different measures and populations fuel the debate. The present study adds to the gender-spatial-math literature by examining this relationship in the context of high-stakes math testing. Results indicate no gender effect on spatial ability or math achievement, and a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Listening
سال: 2022
ISSN: ['1090-4018', '1932-586X']
DOI: https://doi.org/10.1080/10904018.2021.1993446